Overview

Brought to you by YData

Dataset statistics

Number of variables11
Number of observations699
Missing cells0
Missing cells (%)0.0%
Duplicate rows8
Duplicate rows (%)1.1%
Total size in memory60.2 KiB
Average record size in memory88.2 B

Variable types

Numeric9
Categorical2

Alerts

Dataset has 8 (1.1%) duplicate rowsDuplicates
Bare Nuclei is highly overall correlated with ClassHigh correlation
Bland Chromatin is highly overall correlated with Class and 6 other fieldsHigh correlation
Class is highly overall correlated with Bare Nuclei and 8 other fieldsHigh correlation
Clump Thickness is highly overall correlated with Bland Chromatin and 6 other fieldsHigh correlation
Marginal Adhesion is highly overall correlated with Bland Chromatin and 6 other fieldsHigh correlation
Mitoses is highly overall correlated with Class and 2 other fieldsHigh correlation
Normal Nucleoli is highly overall correlated with Bland Chromatin and 7 other fieldsHigh correlation
Single Epithelial Cell Size is highly overall correlated with Bland Chromatin and 6 other fieldsHigh correlation
Uniformity of Cell Shape is highly overall correlated with Bland Chromatin and 6 other fieldsHigh correlation
Uniformity of Cell Size is highly overall correlated with Bland Chromatin and 7 other fieldsHigh correlation

Reproduction

Analysis started2025-01-02 19:11:42.536518
Analysis finished2025-01-02 19:11:50.493184
Duration7.96 seconds
Software versionydata-profiling vv4.12.1
Download configurationconfig.json

Variables

Sample code number
Real number (ℝ)

Distinct645
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1071704.1
Minimum61634
Maximum13454352
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:50.595218image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum61634
5-th percentile411453
Q1870688.5
median1171710
Q31238298
95-th percentile1333890.8
Maximum13454352
Range13392718
Interquartile range (IQR)367609.5

Descriptive statistics

Standard deviation617095.73
Coefficient of variation (CV)0.57580794
Kurtosis257.71716
Mean1071704.1
Median Absolute Deviation (MAD)104381
Skewness13.675326
Sum7.4912116 × 108
Variance3.8080714 × 1011
MonotonicityNot monotonic
2025-01-02T20:11:50.750504image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1182404 6
 
0.9%
1276091 5
 
0.7%
1198641 3
 
0.4%
1158247 2
 
0.3%
1070935 2
 
0.3%
733639 2
 
0.3%
385103 2
 
0.3%
1212422 2
 
0.3%
798429 2
 
0.3%
1173347 2
 
0.3%
Other values (635) 671
96.0%
ValueCountFrequency (%)
61634 1
0.1%
63375 1
0.1%
76389 1
0.1%
95719 1
0.1%
128059 1
0.1%
142932 1
0.1%
144888 1
0.1%
145447 1
0.1%
160296 1
0.1%
167528 1
0.1%
ValueCountFrequency (%)
13454352 1
0.1%
8233704 1
0.1%
1371920 1
0.1%
1371026 1
0.1%
1369821 1
0.1%
1368882 1
0.1%
1368273 1
0.1%
1368267 1
0.1%
1365328 1
0.1%
1365075 1
0.1%

Clump Thickness
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4177396
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:50.866503image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.8157407
Coefficient of variation (CV)0.63737135
Kurtosis-0.62371541
Mean4.4177396
Median Absolute Deviation (MAD)2
Skewness0.59285853
Sum3088
Variance7.9283955
MonotonicityNot monotonic
2025-01-02T20:11:50.962732image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 145
20.7%
5 130
18.6%
3 108
15.5%
4 80
11.4%
10 69
9.9%
2 50
 
7.2%
8 46
 
6.6%
6 34
 
4.9%
7 23
 
3.3%
9 14
 
2.0%
ValueCountFrequency (%)
1 145
20.7%
2 50
 
7.2%
3 108
15.5%
4 80
11.4%
5 130
18.6%
6 34
 
4.9%
7 23
 
3.3%
8 46
 
6.6%
9 14
 
2.0%
10 69
9.9%
ValueCountFrequency (%)
10 69
9.9%
9 14
 
2.0%
8 46
 
6.6%
7 23
 
3.3%
6 34
 
4.9%
5 130
18.6%
4 80
11.4%
3 108
15.5%
2 50
 
7.2%
1 145
20.7%

Uniformity of Cell Size
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1344778
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:51.060777image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q35
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.0514591
Coefficient of variation (CV)0.97351434
Kurtosis0.098802885
Mean3.1344778
Median Absolute Deviation (MAD)0
Skewness1.2331366
Sum2191
Variance9.3114027
MonotonicityNot monotonic
2025-01-02T20:11:51.159945image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 384
54.9%
10 67
 
9.6%
3 52
 
7.4%
2 45
 
6.4%
4 40
 
5.7%
5 30
 
4.3%
8 29
 
4.1%
6 27
 
3.9%
7 19
 
2.7%
9 6
 
0.9%
ValueCountFrequency (%)
1 384
54.9%
2 45
 
6.4%
3 52
 
7.4%
4 40
 
5.7%
5 30
 
4.3%
6 27
 
3.9%
7 19
 
2.7%
8 29
 
4.1%
9 6
 
0.9%
10 67
 
9.6%
ValueCountFrequency (%)
10 67
 
9.6%
9 6
 
0.9%
8 29
 
4.1%
7 19
 
2.7%
6 27
 
3.9%
5 30
 
4.3%
4 40
 
5.7%
3 52
 
7.4%
2 45
 
6.4%
1 384
54.9%

Uniformity of Cell Shape
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2074392
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:51.252225image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q35
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.9719128
Coefficient of variation (CV)0.9265687
Kurtosis0.00701098
Mean3.2074392
Median Absolute Deviation (MAD)0
Skewness1.1618592
Sum2242
Variance8.8322655
MonotonicityNot monotonic
2025-01-02T20:11:51.352557image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 353
50.5%
2 59
 
8.4%
10 58
 
8.3%
3 56
 
8.0%
4 44
 
6.3%
5 34
 
4.9%
7 30
 
4.3%
6 30
 
4.3%
8 28
 
4.0%
9 7
 
1.0%
ValueCountFrequency (%)
1 353
50.5%
2 59
 
8.4%
3 56
 
8.0%
4 44
 
6.3%
5 34
 
4.9%
6 30
 
4.3%
7 30
 
4.3%
8 28
 
4.0%
9 7
 
1.0%
10 58
 
8.3%
ValueCountFrequency (%)
10 58
 
8.3%
9 7
 
1.0%
8 28
 
4.0%
7 30
 
4.3%
6 30
 
4.3%
5 34
 
4.9%
4 44
 
6.3%
3 56
 
8.0%
2 59
 
8.4%
1 353
50.5%

Marginal Adhesion
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.806867
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:51.448792image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q34
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.8553792
Coefficient of variation (CV)1.0172834
Kurtosis0.98794707
Mean2.806867
Median Absolute Deviation (MAD)0
Skewness1.5244681
Sum1962
Variance8.1531906
MonotonicityNot monotonic
2025-01-02T20:11:51.539792image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 407
58.2%
3 58
 
8.3%
2 58
 
8.3%
10 55
 
7.9%
4 33
 
4.7%
8 25
 
3.6%
5 23
 
3.3%
6 22
 
3.1%
7 13
 
1.9%
9 5
 
0.7%
ValueCountFrequency (%)
1 407
58.2%
2 58
 
8.3%
3 58
 
8.3%
4 33
 
4.7%
5 23
 
3.3%
6 22
 
3.1%
7 13
 
1.9%
8 25
 
3.6%
9 5
 
0.7%
10 55
 
7.9%
ValueCountFrequency (%)
10 55
 
7.9%
9 5
 
0.7%
8 25
 
3.6%
7 13
 
1.9%
6 22
 
3.1%
5 23
 
3.3%
4 33
 
4.7%
3 58
 
8.3%
2 58
 
8.3%
1 407
58.2%

Single Epithelial Cell Size
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2160229
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:51.634853image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q34
95-th percentile8
Maximum10
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.2142999
Coefficient of variation (CV)0.68852118
Kurtosis2.1690664
Mean3.2160229
Median Absolute Deviation (MAD)0
Skewness1.7121718
Sum2248
Variance4.903124
MonotonicityNot monotonic
2025-01-02T20:11:51.726854image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2 386
55.2%
3 72
 
10.3%
4 48
 
6.9%
1 47
 
6.7%
6 41
 
5.9%
5 39
 
5.6%
10 31
 
4.4%
8 21
 
3.0%
7 12
 
1.7%
9 2
 
0.3%
ValueCountFrequency (%)
1 47
 
6.7%
2 386
55.2%
3 72
 
10.3%
4 48
 
6.9%
5 39
 
5.6%
6 41
 
5.9%
7 12
 
1.7%
8 21
 
3.0%
9 2
 
0.3%
10 31
 
4.4%
ValueCountFrequency (%)
10 31
 
4.4%
9 2
 
0.3%
8 21
 
3.0%
7 12
 
1.7%
6 41
 
5.9%
5 39
 
5.6%
4 48
 
6.9%
3 72
 
10.3%
2 386
55.2%
1 47
 
6.7%

Bare Nuclei
Categorical

High correlation 

Distinct11
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
1
402 
10
132 
2
 
30
5
 
30
3
 
28
Other values (6)
77 

Length

Max length2
Median length1
Mean length1.1888412
Min length1

Characters and Unicode

Total characters831
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row10
3rd row2
4th row4
5th row1

Common Values

ValueCountFrequency (%)
1 402
57.5%
10 132
 
18.9%
2 30
 
4.3%
5 30
 
4.3%
3 28
 
4.0%
8 21
 
3.0%
4 19
 
2.7%
? 16
 
2.3%
9 9
 
1.3%
7 8
 
1.1%

Length

2025-01-02T20:11:51.833855image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 402
57.5%
10 132
 
18.9%
2 30
 
4.3%
5 30
 
4.3%
3 28
 
4.0%
8 21
 
3.0%
4 19
 
2.7%
16
 
2.3%
9 9
 
1.3%
7 8
 
1.1%

Most occurring characters

ValueCountFrequency (%)
1 534
64.3%
0 132
 
15.9%
2 30
 
3.6%
5 30
 
3.6%
3 28
 
3.4%
8 21
 
2.5%
4 19
 
2.3%
? 16
 
1.9%
9 9
 
1.1%
7 8
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 815
98.1%
Other Punctuation 16
 
1.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 534
65.5%
0 132
 
16.2%
2 30
 
3.7%
5 30
 
3.7%
3 28
 
3.4%
8 21
 
2.6%
4 19
 
2.3%
9 9
 
1.1%
7 8
 
1.0%
6 4
 
0.5%
Other Punctuation
ValueCountFrequency (%)
? 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 831
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 534
64.3%
0 132
 
15.9%
2 30
 
3.6%
5 30
 
3.6%
3 28
 
3.4%
8 21
 
2.5%
4 19
 
2.3%
? 16
 
1.9%
9 9
 
1.1%
7 8
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 831
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 534
64.3%
0 132
 
15.9%
2 30
 
3.6%
5 30
 
3.6%
3 28
 
3.4%
8 21
 
2.5%
4 19
 
2.3%
? 16
 
1.9%
9 9
 
1.1%
7 8
 
1.0%

Bland Chromatin
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4377682
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:51.928817image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum10
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.4383643
Coefficient of variation (CV)0.70928698
Kurtosis0.18462131
Mean3.4377682
Median Absolute Deviation (MAD)1
Skewness1.0999691
Sum2403
Variance5.9456202
MonotonicityNot monotonic
2025-01-02T20:11:52.027067image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2 166
23.7%
3 165
23.6%
1 152
21.7%
7 73
10.4%
4 40
 
5.7%
5 34
 
4.9%
8 28
 
4.0%
10 20
 
2.9%
9 11
 
1.6%
6 10
 
1.4%
ValueCountFrequency (%)
1 152
21.7%
2 166
23.7%
3 165
23.6%
4 40
 
5.7%
5 34
 
4.9%
6 10
 
1.4%
7 73
10.4%
8 28
 
4.0%
9 11
 
1.6%
10 20
 
2.9%
ValueCountFrequency (%)
10 20
 
2.9%
9 11
 
1.6%
8 28
 
4.0%
7 73
10.4%
6 10
 
1.4%
5 34
 
4.9%
4 40
 
5.7%
3 165
23.6%
2 166
23.7%
1 152
21.7%

Normal Nucleoli
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8669528
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:52.123029image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q34
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.0536339
Coefficient of variation (CV)1.0651148
Kurtosis0.47426868
Mean2.8669528
Median Absolute Deviation (MAD)0
Skewness1.4222613
Sum2004
Variance9.32468
MonotonicityNot monotonic
2025-01-02T20:11:52.217068image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 443
63.4%
10 61
 
8.7%
3 44
 
6.3%
2 36
 
5.2%
8 24
 
3.4%
6 22
 
3.1%
5 19
 
2.7%
4 18
 
2.6%
7 16
 
2.3%
9 16
 
2.3%
ValueCountFrequency (%)
1 443
63.4%
2 36
 
5.2%
3 44
 
6.3%
4 18
 
2.6%
5 19
 
2.7%
6 22
 
3.1%
7 16
 
2.3%
8 24
 
3.4%
9 16
 
2.3%
10 61
 
8.7%
ValueCountFrequency (%)
10 61
 
8.7%
9 16
 
2.3%
8 24
 
3.4%
7 16
 
2.3%
6 22
 
3.1%
5 19
 
2.7%
4 18
 
2.6%
3 44
 
6.3%
2 36
 
5.2%
1 443
63.4%

Mitoses
Real number (ℝ)

High correlation 

Distinct9
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5894134
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2025-01-02T20:11:52.308067image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile5
Maximum10
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.7150779
Coefficient of variation (CV)1.0790634
Kurtosis12.657878
Mean1.5894134
Median Absolute Deviation (MAD)0
Skewness3.5606578
Sum1111
Variance2.9414923
MonotonicityNot monotonic
2025-01-02T20:11:52.404299image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 579
82.8%
2 35
 
5.0%
3 33
 
4.7%
10 14
 
2.0%
4 12
 
1.7%
7 9
 
1.3%
8 8
 
1.1%
5 6
 
0.9%
6 3
 
0.4%
ValueCountFrequency (%)
1 579
82.8%
2 35
 
5.0%
3 33
 
4.7%
4 12
 
1.7%
5 6
 
0.9%
6 3
 
0.4%
7 9
 
1.3%
8 8
 
1.1%
10 14
 
2.0%
ValueCountFrequency (%)
10 14
 
2.0%
8 8
 
1.1%
7 9
 
1.3%
6 3
 
0.4%
5 6
 
0.9%
4 12
 
1.7%
3 33
 
4.7%
2 35
 
5.0%
1 579
82.8%

Class
Categorical

High correlation 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
benign
458 
malignant
241 

Length

Max length9
Median length6
Mean length7.0343348
Min length6

Characters and Unicode

Total characters4917
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowbenign
2nd rowbenign
3rd rowbenign
4th rowbenign
5th rowbenign

Common Values

ValueCountFrequency (%)
benign 458
65.5%
malignant 241
34.5%

Length

2025-01-02T20:11:52.511467image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-01-02T20:11:52.611745image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
ValueCountFrequency (%)
benign 458
65.5%
malignant 241
34.5%

Most occurring characters

ValueCountFrequency (%)
n 1398
28.4%
g 699
14.2%
i 699
14.2%
a 482
 
9.8%
b 458
 
9.3%
e 458
 
9.3%
m 241
 
4.9%
l 241
 
4.9%
t 241
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4917
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 1398
28.4%
g 699
14.2%
i 699
14.2%
a 482
 
9.8%
b 458
 
9.3%
e 458
 
9.3%
m 241
 
4.9%
l 241
 
4.9%
t 241
 
4.9%

Most occurring scripts

ValueCountFrequency (%)
Latin 4917
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 1398
28.4%
g 699
14.2%
i 699
14.2%
a 482
 
9.8%
b 458
 
9.3%
e 458
 
9.3%
m 241
 
4.9%
l 241
 
4.9%
t 241
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4917
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 1398
28.4%
g 699
14.2%
i 699
14.2%
a 482
 
9.8%
b 458
 
9.3%
e 458
 
9.3%
m 241
 
4.9%
l 241
 
4.9%
t 241
 
4.9%

Interactions

2025-01-02T20:11:49.285789image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:42.949307image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.907990image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.672668image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.432785image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.224299image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.981697image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.769290image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.527980image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.383493image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.079523image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.010778image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.772768image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.529909image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.322260image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.078853image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.872321image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.621980image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.464275image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.176286image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.096818image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.853760image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.613605image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.404260image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.163149image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.953324image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.703942image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.551731image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.269327image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.178684image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.935766image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.695780image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.486297image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.247899image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.036320image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.785980image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.632929image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.358527image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.260720image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.019008image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.780434image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.569653image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.333898image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.118235image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.871212image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.716260image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.451557image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.343911image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.105008image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.864414image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.650857image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.416284image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.201230image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.952687image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.797496image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.540527image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.426902image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.185718image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.977661image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.734599image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.499284image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.284655image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.038789image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.880610image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.633732image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.508283image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.268618image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.061625image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.818599image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.583324image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.366762image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.122819image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.961567image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:43.723714image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:44.590546image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:45.351562image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.145033image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:46.898697image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:47.682292image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:48.446753image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
2025-01-02T20:11:49.204822image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Correlations

2025-01-02T20:11:52.684745image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Bare NucleiBland ChromatinClassClump ThicknessMarginal AdhesionMitosesNormal NucleoliSample code numberSingle Epithelial Cell SizeUniformity of Cell ShapeUniformity of Cell Size
Bare Nuclei1.0000.2550.8340.2230.2630.1940.2510.0000.2700.2780.287
Bland Chromatin0.2551.0000.8040.5380.6250.3870.662-0.0960.6400.6920.719
Class0.8340.8041.0000.7380.7380.5190.7680.0000.7910.8600.875
Clump Thickness0.2230.5380.7381.0000.5420.4190.570-0.0040.5840.6640.666
Marginal Adhesion0.2630.6250.7380.5421.0000.4470.634-0.0500.6680.7120.743
Mitoses0.1940.3870.5190.4190.4471.0000.504-0.0750.4800.4730.509
Normal Nucleoli0.2510.6620.7680.5700.6340.5041.000-0.0710.7060.7250.757
Sample code number0.000-0.0960.000-0.004-0.050-0.075-0.0711.000-0.087-0.060-0.043
Single Epithelial Cell Size0.2700.6400.7910.5840.6680.4800.706-0.0871.0000.7590.787
Uniformity of Cell Shape0.2780.6920.8600.6640.7120.4730.725-0.0600.7591.0000.892
Uniformity of Cell Size0.2870.7190.8750.6660.7430.5090.757-0.0430.7870.8921.000

Missing values

2025-01-02T20:11:50.222723image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
A simple visualization of nullity by column.
2025-01-02T20:11:50.402630image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Sample code numberClump ThicknessUniformity of Cell SizeUniformity of Cell ShapeMarginal AdhesionSingle Epithelial Cell SizeBare NucleiBland ChromatinNormal NucleoliMitosesClass
01000025511121311benign
110029455445710321benign
21015425311122311benign
31016277688134371benign
41017023411321311benign
51017122810108710971malignant
610180991111210311benign
71018561212121311benign
81033078211121115benign
91033078421121211benign
Sample code numberClump ThicknessUniformity of Cell SizeUniformity of Cell ShapeMarginal AdhesionSingle Epithelial Cell SizeBare NucleiBland ChromatinNormal NucleoliMitosesClass
689654546111121118benign
690654546111321111benign
69169509151010545441malignant
692714039311121111benign
693763235311121212benign
694776715311132111benign
695841769211121111benign
696888820510103738102malignant
6978974714864341061malignant
6988974714885451041malignant

Duplicate rows

Most frequently occurring

Sample code numberClump ThicknessUniformity of Cell SizeUniformity of Cell ShapeMarginal AdhesionSingle Epithelial Cell SizeBare NucleiBland ChromatinNormal NucleoliMitosesClass# duplicates
03206753352310711malignant2
1466906111121111benign2
2704097111111211benign2
31100524610102810733malignant2
41116116910101108331malignant2
51198641311121311benign2
61218860111111311benign2
71321942511121311benign2